High Agreement and High Prevalence: The Paradox of Cohen’s Kappa

نویسندگان

  • Slavica Zec
  • Nicola Soriani
  • Rosanna Comoretto
  • Ileana Baldi
چکیده

Background Cohen's Kappa is the most used agreement statistic in literature. However, under certain conditions, it is affected by a paradox which returns biased estimates of the statistic itself. Objective The aim of the study is to provide sufficient information which allows the reader to make an informed choice of the correct agreement measure, by underlining some optimal properties of Gwet's AC1 in comparison to Cohen's Kappa, using a real data example. Method During the process of literature review, we have asked a panel of three evaluators to come up with a judgment on the quality of 57 randomized controlled trials assigning a score to each trial using the Jadad scale. The quality was evaluated according to the following dimensions: adopted design, randomization unit, type of primary endpoint. With respect to each of the above described features, the agreement between the three evaluators has been calculated using Cohen's Kappa statistic and Gwet's AC1 statistic and, finally, the values have been compared with the observed agreement. Results The values of the Cohen's Kappa statistic would lead to believe that the agreement levels for the variables Unit, Design and Primary Endpoints are totally unsatisfactory. The AC1 statistic, on the contrary, shows plausible values which are in line with the respective values of the observed concordance. Conclusion We conclude that it would always be appropriate to adopt the AC1 statistic, thus bypassing any risk of incurring the paradox and drawing wrong conclusions about the results of agreement analysis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing Persian version of Sensory Gating Inventory ‎‎(SGI): Validity and Reliability‎

Introduction: Sensory Gating Inventory (SGI) measures behavioral aspects of Sensory Gating (SG). It is a filtering mechanism of brain that prevents irrelevant sensory inputs from entering into higher cortex information processing. It modifies sensitivity to sensory stimuli. Abnormal SG leads to overloading of information into cortex and brain dysfunction. Electrophysiological techniques cannot ...

متن کامل

On population-based measures of agreement

Measuring agreement between qualified experts is commonly used to determine the effectiveness of a diagnostic procedure. Many methods are available for assessing agreement, including Cohen’s kappa, which is a very popular summary measure of agreement due to its appealingly simple usage and interpretation. However, it has been previously shown that a number of flaws exist in its usage, which can...

متن کامل

A Formal Proof of a Paradox Associated with Cohen's Kappa

Suppose two judges each classify a group of objects into one of several nominal categories. It has been observed in the literature that, for fixed observed agreement between the judges, Cohen’s kappa penalizes judges with similar marginals compared to judges who produce different marginals. This paper presents a formal proof of this phenomenon.

متن کامل

Diagnostic concordance among dermatopathologists in basal cell carcinoma subtyping: Results of a study in a skin referral hospital in Tehran, Iran

Background: Basal cell carcinomas (BCC) are the most prevalent among non-melanoma skin cancers (NMSC), which correspond to the most common skin cancers. BCC histopathological subtyping is a problem in therapeutic management. Therefore, we have decided to perform a histopathologic study for better classification of BCCs based on interobserver diagnostic judgment. Methods: We conducted this cross...

متن کامل

ارزیابی توافق پرسشنامه‌های کتبی و ویدیویی در مطالعه بین‌المللی آسم و آلرژی در کودکان شهر تهران

Introduction: International study on asthma was conducted to study the prevalence of Asthma symptoms among 13-14 year old children using written and video questionnaires during the early 90's. The aim of the present study (ISAAC) was to evaluate the agreement between the two questionnaires which were self-completed by the children. Methods: This study, which was a part of the third phase of In...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2017